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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/763 , 076A 



DATE: 06/29/2001 
TIME: 12:10:06 




3 
4 
5 
6 
7 
9 
11 

C--> 13 
C--> 14 

16 
17 
19 
20 
22 
23 
25 
27 
29 
30 
31 
32 
34 
35 
36 
37 
38 
39 
40 
41 
42 
46 
47 
48 
49 
51 



. Input Set : A:\PPD50348 US SEQ LIST.txt 
Output Set: N:\CRF3\06292001\I763076A.raw 

<110> APPLICANT: Broekaert, Willem 
Francois , Isabelle 
Evans, Ian 
De Bolle, Miguel 
Ray, John 

TITLE OF INVENTION: Genetic Method For The Expression of Polyproteins in Plants 



<120> 
<130> 
<140> 
<141> 

<150> 
<151> 
<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<400> 

atggtgaatc 
tcaggttatc 
ttttatgtgt 
tatgcgagaa 
accaatgtaa 
acatgtgttt 
aagccgaaca 
aagtggttcc 
<210> SEQ 
<211> 
<212> 
<213> 
<400> 



FILE REFERENCE: PPD50348/UST 

CURRENT APPLICATION NUMBER: US/09/763 , 076A 
CURRENT FILING DATE: 1999-08-17 

PRIOR APPLICATION NUMBER: GB 9818001.1 
PRIOR FILING DATE: 1998-08-18 
PRIOR APPLICATION NUMBER: GB 9826753.7 
PRIOR FILING DATE: 1998-12-14 
PRIOR APPLICATION NUMBER: PCT/GB99/02716 
PRIOR FILING DATE: 1999-08-17 
NUMBER OF SEQ ID NOS : 81 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO: 1 
LENGTH: 446 
TYPE: DNA 

ORGANISM: Dahlia merckii 
SEQUENCE: 1 

gttctccgcg 
ttcatttatt 
tgcaaatatt 
acatggtcgg 
ggtgcggccc 



ENTERED 



ggtcggttgc 
aaatctttag 
tctgacaagt 
agctagcaag 
atcatgggag 
ctgttacttc 
actcgctcaa 
aaacgttgaa 
ID NO: 2 
LENGTH: 118 
TYPE: PRT 
ORGANISM: 
SEQUENCE: 



ttcgttctga tccttttcgt gctcgccatc 60 

gaatatgata gtatttatat tcttttatgg 120 

gagtagatat cgcatccgtt agtggagaac 180 

gaaactgtgg caatacggga cattgtgaca 240 

atggagcgtg tcatgtgcgt aacgggaaac 300 

aattgtaaaa aagccgaaaa gcttgctcaa gacaaactta 360 

gacaaactta atgcccaaaa gcttgaccgt gatgccaaga 420 
catccg 446 



Dahlia merckii 
2 



52 


Met 


Val 


Asn 


Arg 


Ser 


Val 


Ala 


Phe 


Ser 


Ala 


Phe 


Val 


Leu 


He 


Leu 


Phe 


53 


1 








5 










10 










15 




55 


Val 


Leu 


Ala 


He 


Ser 


Asp 


He 


Ala 


Ser 


Val 


Ser 


Gly 


Glu 


Leu 


Cys 


Glu 


56 








20 










25 










30 






58 


Lys 


Ala 


Ser 


Lys 


Thr 


Trp 


Ser 


Gly 


Asn 


Cys 


Gly 


Asn 


Thr 


Gly 


His 


Cys 


59 






35 










40 










45 








61 


Asp 


Asn 


Gin 


Cys 


Lys 


Ser 


Trp 


Glu 


Gly Ala 


Ala 


His 


Gly 


Ala 


Cys 


His 


62 




50 










55 










60 










64 


Val 


Arg 


Asn 


Gly 




His 


Met 


Cys 


Phe 


Cys 


Tyr 


Phe 


Asn 


Cys 


Lys 


Lys 


65 


65 










70 










75 










80 


67 


Ala 


Glu 


Lys 


Leu 


Ala 


Gin 


Asp 


Lys 


Leu 


Lys 


Ala 


Glu 


Gin 


Leu 


Ala 


Gin 


68 










85 










90 










95 
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RAW SEQUENCE LISTING DATE: 06/29/2001 

PATENT APPLICATION: US/09/763 , 076A TIME: 12:10:06 

Input Set : A:\PPD50348 US SEQ LIST.txt 
Output Set: N:\CRF3\06292001\I763076A.raw 

70 Asp Lys Leu Asn Ala Gin Lys Leu Asp Arg Asp Ala Lys Lys Val Val 

71 100 105 110 

73 Pro Asn Val Glu His Pro 

74 115 

78 <210> SEQ ID NO: 3 

79 <211> LENGTH: 16 

80 <212> TYPE: PRT 

81 <213> ORGANISM: Artificial Sequence 

83 <220> FEATURE: 

84 <223> OTHER INFORMATION: Description of Artificial Sequence: Linker 

85 propeptide 

87 <400> SEQUENCE: 3 

88 Ser Asn Ala Ala Asp Glu Val Ala Thr Pro Glu Asp Val Glu Pro Gly 

89 1 5 10 15 

93 <210> SEQ ID NO: 4 

94 <211> LENGTH: 20 

95 <212> TYPE: PRT 

96 <213> ORGANISM: Artificial Sequence 

98 <220> FEATURE: 

99 <223> OTHER INFORMATION: Description of Artificial Sequence: Linker 

100 propeptide 

102 <400> SEQUENCE: 4 

103 Lys Lys Ala Glu Lys Leu Ala Gin Asp Lys Leu Lys Ala Glu Gin Leu 

104 15 10 15 

106 lie Gly Lys Arg 

107 20 

111 <210> SEQ ID NO: 5 

112 <211> LENGTH: 40 

113 <212> TYPE: PRT 

114 <213> ORGANISM: Dahlia merckii 

116 <400> SEQUENCE: 5 

117 Lys Lys Ala Glu Lys Leu Ala Gin Asp Lys Leu Lys Ala Glu Gin Leu 

118 15 10 15 

120 Ala Gin Asp Lys Leu Asn Ala Gin Lys Leu Asp Arg Asp Ala Lys Lys 

121 20 25 30 

123 Val Val Pro Asn Val Glu His Pro 

124 35 40 

128 <210> SEQ ID NO: 6 

129 <211> LENGTH: 44 

130 <212> TYPE: PRT 

131 <213> ORGANISM: Artificial Sequence 

133 <220> FEATURE: 

134 <223> OTHER INFORMATION: Description of Artificial Sequence: Linker 

135 propeptide 

137 <400> SEQUENCE: 6 

138 Lys Lys Ala Glu Lys Leu Ala Gin Asp Lys Leu Lys Ala Glu Gin Leu 

139 15 10 15 

141 Ala Gin Asp Lys Leu Asn Ala Gin Lys Leu Asp Arg Asp Ala Lys Lys 

142 20 25 30 
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RAW SEQUENCE LISTING DATE: 06/29/2001 

PATENT APPLICATION: US/09/763, 076A TIME: 12:10:06 

Input Set : A:\PPD50348 US SEQ LIST.txt 
Output Set: N:\CRF3\06292001\I763076A.raw 

144 Val Val Pro Asn Val Glu His Pro lie Gly Lys Arg 

145 35 40 

149 <210> SEQ ID NO: 7 

150 <211> LENGTH: 20 

151 <212> TYPE: PRT 

152 <213> ORGANISM: Artificial Sequence 

154 <220> FEATURE: 

155 <223> OTHER INFORMATION: Description of Artificial Sequence: Linker 

156 propeptide 

158 <400> SEQUENCE: 7 

159 Ala Ser Thr Thr Val Asp His Gin Ala Asp Val Ala Ala Thr Lys Thr 

160 15 10 15 

162 He Gly Lys Arg 

163 20 

167 <210> SEQ ID NO: 8 

168 <211> LENGTH: 31 

169 <212> TYPE: PRT 

170 <213> ORGANISM: Amaranthus caudatus 

173 <400> SEQUENCE: 8 

174 Ala Ser Thr Thr Val Asp His Gin Ala Asp Val Ala Ala Thr Lys Thr 

175 15 10 15 

177 Ala Lys Asn Pro Thr Asp Ala Lys Leu Ala Gly Ala Gly Ser Pro 

178 20 25 30 

182 <210> SEQ ID NO: 9 

183 <211> LENGTH: 522 

184 <212> TYPE: DNA 

185 <213> ORGANISM: Artificial Sequence 

187 <220> FEATURE: 

188 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

189 sequence 

191 <220> FEATURE: 

192 <221> NAME/KEY: CDS 

193 <222> LOCATION: (76).. (513) 

195 <400> SEQUENCE: 9 

196 ctcgagtatt tttacaacaa ttaccaacaa caacaaacaa caaacaacat tacaattact 60 

198 atttacaatt acacc atg gtg aat egg teg gtt gcg ttc tec gcg ttc gtt 111 

199 Met Val Asn Arg Ser Val Ala Phe Ser Ala Phe Val 

200 1 5 10 



202 


ctg 


ate 


ctt 


ttc 


gtg 


etc 


gee 


ate 


tea 


gat 


ate 


gca 


tec 


gtt 


agt 


gga 


159 


203 


Leu 


He 


Leu 


Phe 


Val 


Leu 


Ala 


He 


Ser 


Asp 


He 


Ala 


Ser 


Val 


Ser 


Gly 




204 






15 










20 










25 










206 


gaa 


eta 


tgc 


gag 


aaa 


get 


age 


aag 


acg 


tgg 


teg 


ggc 


aac 


tgt 


ggc 


aac 


207 


207 


Glu 


Leu 


Cys 


Glu 


L ys 


Ala 


Ser 


Lys 


Thr 


Trp 


Ser 


Gly 


Asn 


Cys 


Gly 


Asn 




208 




30 










35 










40 












210 


acg 


gga 


cat 


tgt 


gac 


aac 


caa 


tgt 


aaa 


tea 


tgg 


gag 


ggt 


gcg 


gee 


cat 


255 


211 


Thr 


Gly 


His 


Cys 


Asp 


Asn 


Gin 


Cys 


Lys 


Ser 


Trp 


Glu 


Gly 


Ala 


Ala 


His 




212 


45 










50 










55 










60 




214 


gga 


gcg 


tgt 


cat 


gtg 


cgt 


aac 


ggg 


aaa 


cac 


atg 


tgt 


ttc 


tgt 


tac 


ttc 


303 


215 


Gly 


Ala 


Cys 


His 


Val 


Arg 


Asn 


Gly 


Lys 


His 


Met 


Cys 


Phe 


Cys 


Tyr 


Phe 
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RAW SEQUENCE LISTING DATE: 06/29/2001 

PATENT APPLICATION: US/09/763 f 076A TIME: 12:10:06 

Input Set : A:\PPD50348 US SEQ LIST.txt 
Output Set: N:\CRF3\06292001\I763076A.raw 



216 










65 










70 










75 






218 


aat 


tgt 


tec 


aac 


get 


get 


gac 


gag 


gtg 


get 


ace 


cca 


gag 


gac 


gtg 


gag 


351 


219 


Asn 


Cys 


Ser 


Asn 


Ala 


Ala 


Asp 


Glu 


Val 


Ala 


Thr 


Pro 


Glu 


Asp 


Val 


Glu 




220 








80 










85 










90 








222 


cca 


gga 


cag 


aag 


ttg 


tgc 


caa 


agg 


cca 


agt 


ggg 


aca 


tgg 


tea 


gga 


gtc 


399 


223 


Pro 


Gly 


Gin 


Lys 


Leu 


Cys 


Gin 


Arg 


Pro 


Ser 


Gly 


Thr 


Trp 


Ser 


Gly 


Val 




224 






95 










100 










105 










226 


tgt 


gga 


aac 


aat 


aac 


gca 


tgc 


aag 


aat 


cag 


tgc 


att 


aga 


ctt 


gag 


aaa 


447 


227 


Cys 


Gly 


Asn 


Asn 


Asn 


Ala 


Cys 


Lys 


Asn 


Gin 


Cys 


He 


Arg 


Leu 


Glu 


Lys 




228 




110 










115 










120 












230 


gca 


cga 


cat 


gga 


tct 


tgc 


aac 


tat 


gtc 


ttc 


cca 


get 


cac 


aag 


tgt 


ate 


495 


231 


Ala 


Arg 


His 


Gly 


Ser 


Cys 


Asn 


Tyr 


Val 


Phe 


Pro 


Ala 


His 


Lys 


Cys 


He 




232 


125 










130 










135 










140 




234 


tgc 


tac 


ttt 


cct 


tgt 


taa 


taggagctc 
















522 


235 


Cys 


Tyr 


Phe 


Pro 


Cys 



























236 145 

239 <210> SEQ ID NO: 10 

240 <211> LENGTH: 145 

241 <212> TYPE: PRT 

242 <213> ORGANISM: Artificial Sequence 

244 <220> FEATURE: 

245 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

246 sequence 
248 <400> SEQUENCE: 10 



249 


Met 


Val 


Asn 


Arg 


Ser 


Val 


Ala 


Phe 


Ser 


Ala 


Phe 


Val 


Leu 


He 


Leu 


Phe 


250 


1 








5 










10 










15 




252 


Val 


Leu 


Ala 


He 


Ser 


Asp 


He 


Ala 


Ser 


Val 


Ser 


Gly 


Glu 


Leu 


Cys 


Glu 


253 








20 










25 










30 






255 


Lys 


Ala 


Ser 


Lys 


Thr 


Trp 


Ser 


Gly 


Asn 


Cys 


Gly 


Asn 


Thr 


Gly 


His 


Cys 


256 






35 










40 










45 








258 


A sp 


Asn 


Gin 


Cys 


Lys 


Ser 


Trp 


Glu 


Gly 


Ala 


Ala 


His 


Gly 


Ala 


Cys 


His 


259 




50 










55 










60 










261 


Val 


Arg 


Asn 


Gly 


Lys 


His 


Met 


Cys 


Phe 


Cys 


Tyr 


Phe 


Asn 


Cys 


Ser 


Asn 


262 


65 










70 










75 










80 


264 


Ala 


Ala 


Asp 


Glu 


Val 


Ala 


Thr 


Pro 


Glu 


Asp 


Val 


Glu 


Pro 


Gly 


Gin 


Lys 


265 










85 










90 










95 




267 


Leu 


Cys 


Gin 


Arg 


Pro 


Ser 


Gly 


Thr 


Trp 


Ser 


Gly 


Val 


Cys 


Gly 


Asn 


Asn 


268 








100 










105 










110 






270 


Asn 


Ala 


Cys 


Lys 


Asn 


Gin 


Cys 


He 


Arg 


Leu 


Glu 


Lys 


Ala 


Arg 


His 


Gly 


271 






115 










120 










125 








273 


Ser 


Cys 


Asn 


Tyr 


Val 


Phe 


Pro 


Ala 


His 


Lys 


Cys 


He 


Cys 


Tyr 


Phe 


Pro 



274 130 135 140 

276 Cys 

277 145 

281 <210> SEQ ID NO: 11 

282 <211> LENGTH: 534 

283 <212> TYPE: DNA 

284 <213> ORGANISM: Artificial Sequence 
286 <220> FEATURE: 
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ir 



287 
288 
290 
291 
292 
294 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/7 63 f 076A 



DATE: 06/29/2001 
TIME: 12:10:06 



Input Set : A:\PPD50348 US SEQ LIST.txt 
Output Set: N:\CRF3\06292001\I763076A.raw 



<223> OTHER INFORMATION: 
sequence 



Description of Artificial Sequence: Synthetic 



<220> 
<221> 
<222> 
<400> 



FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 



CDS 

(76). .(525) 
11 

295 ctcgagtatt tttacaacaa ttaccaacaa caacaaacaa caaacaacat tacaattact 
297 atttacaatt acacc atg gtg aat egg teg gtt gcg ttc tec gcg ttc gtt 

'Met Val Asn Arg Ser 
1 5 



60 
111 



298 
299 
301 
302 
303 
305 
306 
307 
309 
310 
311 
313 
314 
315 
317 
318 
319 
321 
322 
323 



327 
329 
330 
331 
333 
334 
335 
338 
339 
340 
341 
343 
344 
345 
347 
348 
349 
351 
352 



Val Ala Phe Ser 



Ala 
10 



Phe Val 



ctg 


ate 


ctt 


ttc 


gtg 


etc 


gee 


ate 


tea 


gat 


ate 


gca 


tec 


gtt 


agt 


gga 


159 


Leu 


He 


Leu 


Phe 


Val 


Leu 


Ala 


He 


Ser 


Asp 


He 


Ala 


Ser 


Val 


Ser Gly 








15 










20 










25 










gaa 


eta 


tgc 


gag 


aaa 


get 


age 


aag 


acg 


tgg 


teg 


ggc 


aac 


tgt 


ggc 


aac 


207 


Glu 


Leu 


Cys 


Glu 


Lys 


Ala 


Ser 


Lys 


Thr 


Trp 


Ser 


Gly 


Asn 


Cys 


Gly 


Asn 






30 










35 










40 












acg 


gga 


cat 


tgt 


gac 


aac 


caa 


tgt 


aaa 


tea 


tgg 


gag 


ggt 


gcg 


gee 


cat 


255 


Thr 


Gly 


His 


Cys 


Asp 


Asn 


Gin 


Cys 


Lys 


Ser 


Trp 


Glu 


Gly 


Ala 


Ala 


His 




45 








50 










55 










60 




gga 


gcg 


tgt 


cat 


gtg 


cgt 


aac 


ggg 


aaa 


cac 


atg 


tgt 


ttc 


tgt 


tac 


ttc 


303 


Gly 


Ala 


Cys 


His 


Val 


Arg 


Asn 


Gly 


Lys 


His 


Met 


Cys 


Phe 


Cys 


Tyr 


Phe 










65 










70 










75 






aat 


tgt 


aaa 


aaa 


gee 


gaa 


aag 


ctt 


get 


caa 


gac 


aaa 


ctt 


aaa 


gee 


gaa 


351 


Asn 


Cys 


Lys 


Lys 


Ala 


Glu 


Lys 


Leu 


Ala 


Gin 


Asp 


Lys 


Leu 


Lys 


Ala 


Glu 






80 










85 










90 








caa 


etc 


ate 


gga 


aag 


agg 


cag 


aag 


ttg 


tgc 


caa 


agg 


cca 


agt 


ggg 


aca 


399 


Gin 


Leu 


He 


Gly 


Lys 


Arg 


Gin 


Lys 


Leu 


Cys 


Gin 


Arg 


Pro 


Ser 


Gly 


Thr 








95 










100 










105 










tgg 


tea 


gga 


gtc 


tgt 


gga 


aac 


aat 


aac 


gca 


tgc 


aag 


aat 


cag 


tgc 


att 


447 


Trp 


Ser 


Gly 


Val 


Cys 


Gly 


Asn 


Asn 


Asn 


Ala 


Cys 


Lys 


Asn 


Gin 


Cys 


He 






110 










115 










120 












aga 


ctt 


gag 


aaa 


gca 


cga 


cat 


gga 


tct 


tgc 


aac 


tat 


gtc 


ttc 


cca 


get 


495 


Arg 


Leu 


Glu 


Lys 


Ala 


Arg 


His 


Gly 


Ser 


Cys 


Asn 


Tyr 


Val 


Phe 


Pro 


Ala 




12 5 










130 










135 










140 




cac 


aag 


tgt 


ate 


tgc 


tac 


ttt 


cct 


tgt 


taa 


taggagctc 








534 


His 


Lys 


Cys 


He 


Cys 


Tyr 


Phe 


Pro 


Cys 



















<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



145 

SEQ ID NO: 12 
LENGTH: 149 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: 
sequence 
SEQUENCE: 



Synthetic 



12 



Phe Val Leu He 



Leu 
15 



Phe 



<400> 

Met Val Asn Arg Ser Val Ala Phe Ser Ala 
15 10 
Val Leu Ala He Ser Asp lie Ala Ser Val Ser Gly Glu Leu Cys Glu 



20 



25 



30 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/763 , 076A 



DATE: 06/29/2001 
TIME: 12:10:07 



Input Set : A:\PPD50348 US SEQ LIST.txt 
Output Set: N:\CRF3\06292001\I763076A.raw 



L:13 M:270 C: Current Application Number differs, Replaced Application Number 
L:14 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:414 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 
L:2059 M:341 W: (46) "n" or n Xaa" used, for SEQ ID#:52 
L:2251 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:66 
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